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It is hardly surprising that much of the early biochemical 
research was concerned with the study of proteins. Proteins 
form the class of biological macromolecules that have the 
most well-defined physicochemical properties and conse- 
quently they were generally easier to isolate and character- 
ize than nucleic acids, polysaccharides, or lipids. Further- 
more, proteins, particularly in the form of enzymes, have 
obvious biochemical functions. The central role that pro- 
teins play in biological processes has therefore been recog- 
nized since the earliest days of biochemistry. In contrast, 
the task of nucleic acids in the transmission and expression 
of genetic information was not realized until the late 1940s, 
the role of lipids in biological membranes was not appre- 
ciated until the 1 960s } and the biological functions of poly- 
saccharides are still somewhat mysterious. 



In this chapter we study the properties of the monomelic 
units of proteins, the amino acids. It is from these sub- 
stances that proteins are synthesized through processes that 
are discussed in Chapter 30, Amino acids are also energy 
metabolites and many of them are essential nutrients 
(Chapter 24). In addition, as we shall see, many amino acids 
and their derivatives are of biochemical importance in their 
own right {Section 4-3B). 

1. THE AMINO ACIDS 
OF PROTEINS 

The analyses of a vast number of proteins from almost 
every conceivable source have shown that all proteins are 
composed of the 20 "standard" amino acids listed in Table 
4-1. These substances are known as a-amino acids because, 
with the exception of proline, they have a primary amino 
group and a carboxylic acid group substituent on the same 
carbon atom (Fig. 4-1; proline has a secondary amino 
group,) 

A. General Properties 

The pA: values of the 20 "standard" a-amino acids of pn> 
teins are tabulated in Table 4- 1 . Here pK { and p^ 2 > respec- 
tively, refer to the a-carboxylic acid and a~amino groups, 
and $K R refers to the side groups with acid - base properties. 
Table 4A indicates that the pK values of the a-carboxylic 
acid groups lie in a small range around 2,2 so that above pH 
3.5 these groups are almost entirely in their carboxylate 
forms. The a-amino groups all have pK values near 9.4 and 
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FIGURE 4-1. The general structural formula for a-amino 
acids. There are 20 different R groups in the commonly 
occurring amino acids (Table 4-1). 
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FIGURE 4-2. The zwitterionic form of the a-amino acids that 
occurs at physiological pH values. 



are therefore almost entirely in their ammonium ion forms 
below pH 8.0. This leads to an important structural point: 
In the physiological pH range, both the carboxylic acid and 
the amino groups ofot-amino acids are completely ionized 
(Fig. 4-2). An amino acid can therefore act as either an acid 
or a base. Substances with this property are said to be am- 
photeric and are referred to as ampholytes (amphoteric 
electrons). In Section 4- ID, we shall delve a bit deeper 
into the acid -base properties of the amino acids. 
. Molecules that bear charged groups of opposite polarity 
are known as zwkterions or dipolar ions* The zwitterionic 
Character of the o>amino acids has been established by sev- 
ieral methods including spectroscopic measurements and 
: X-ray crystal structure determinations (in the solid state the 
a-amino acids are zwitterionic because the basic amine 
group abstracts a proton from the nearby acidic carboxylic 
add group). Because amino acids are zwitterions, their 
.physical properties are characteristic of ionic compounds. 
§ot instance, most a-amino acids have melting points near 
30O°C, whereas their nonionic derivatives usually melt 
around 100°C. Furthermore, amino acids, like other ionic 
compounds, are more soluble in polar solvents than in 
;.nonpolar solvents. Indeed, most tf-amino acids are very 
soluble in water but are largely insoluble in most organic 
solvents. 

B. Peptide Bonds 

#he a-amino acids polymerize, at least conceptually, 
through the elimination of a water molecule as is indicated 
■ In Fig. 4-3. The resulting CO — NH linkage is known as a 

peptide bond. Polymers composed of two, three, a few (3- 
v'itj), and many amino acid residues (alternatively called 
peptide units) are known, respectively, as dipeptides, tri- 

peptides, oligopeptides, and polypeptides. These sub- 
stances, however, are often referred to simply as "peptides." 



Proteins are molecules that consist of one or more polypep- 
tide chains. These polypeptides range in length from -40 to 
over 4000 amino acid residues and, since the average mass 
of an amino acid residue is M 10 D, have molecular masses 
that range from *-4 to over 440 kD. 

Polypeptides are linear polymers; that is, each amino acid 
residue is linked to its neighbors in a head- to- tail fashion 
rather than forming branched chains. This observation re- 
flects the underlying elegant simplicity of the way living 
systems construct these macromolecules for, as we shall see, 
the nucleic acids that encode the amino acid sequences of 
polypeptides are also linear polymers. T his permits the di- 
rect correspondence between the monomer (nucleotide) se- 
quence of a nucleic acid and the monomer (amino acid) 
sequence of the corresponding polypeptide without the 
added complication of specifying the positions and se- 
quences of any branching chains. 

With 20 different choices available for each amino acid 
residue in a polypeptide chain, it is easy to see that a huge 
number of different protein molecules can exist. For exam- 
ple, for dipeptides, each of the 20 different choices for the 
first amino acid residue can have 20 different choices for the 
second amino acid residue, for a total of 20 2 — 400 distinct 
dipeptides. Similarly, for tri peptides, there are 20 possibili- 
ties for each of the 400 choices of dipeptides to yield a total 
of 20 3 = 8000 different tripeptides, A relatively small pro- 
tein molecule consists of a single polypeptide chain of 100 
residues. There are 20 ,00 ~ 1.27 X 10 130 possible unique 
polypeptide chains of this length, a quantity vastly greater 
than the estimated number of atoms in the universe 
(9 X 10 78 )< Clearly, nature can have made only a tiny frac- 
tion of the possible different protein molecules. Neverthe- 
less, the various organisms on Earth collectively synthesize 
an enormous number of different protein molecules whose 
great range of physicochemical characteristics stem largely 
from the varied properties of the 20 "standard" amino acids. 
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FIGURE 4-3. The condensation of two a-amino acids to 
form a dipeptide. The peptide bond is shown in red. 
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TABLE 44. Cova^nt Stk.ct.h.s an» Ab™tk>ns of the "Sr^ W Amino Act„s o, Pro m theib 

OCCURANCE, AND THE PX VALUES OF THEIR IONIZING GROUPS 



Name, 
Three-letter Symbol, 
and One-letter Symbol 



Structural 
Formula* 



Residue 
Mass 
(t»* 



Amino adds mth nonpaiar side chains 

Glycine COO" 

G[ y H—C— H 
G I 



Alanine 
Ala 
A 

Valine 
Val 
V 

Leucine 
Lea 
L 

Isoleucine 
He 

I 

Methionine 

Met 

M 

Proline 

Pro 

P 



coo- 

i 

H—C — CH 3 
NHJ 

COO-yCHj 
H-C-CH 
NHJ" CH, 



CH, 



coo- 

I / 

H — C — CH 2 — CH 
I \ 
NH 3 f CH 3 



COO" CH 3 

H-C 
I 



NHJ 



C— CH 2 — CH, 
H 



coo- 

H-C-CH 2 -CH 7 -S-CH 3 
I 

H 2 

CpO-^i-CH, 
^C 2 J 

H N 
H 



I Phenylalanine COO 
Phe 
F 



/. Tryptophan 



e CUU 




57.0 



71.1 



99. i 



1 13.2 



113.2 



131.2 



97.1 



147.2 



186.2 



Average 
Occurence 
in Proteins 



7.2 



7.8 



6.6 



9 J 



5.3 



2.2 



52 



3.9 



1.4 



P*i 
a-COOH d 



2,35 



2.35 



2.29 



2,33 



2.32 



2.13 



1.95 



2.20 



2.46 



pA 2 



9.78 



9.8? 



9-74 



9.74 



9.76 



9.28 



10.64 



9.31 



9.41 



Side chain' 



_ — — ; — : n n ith ft i W h residue mass is siven for the neutral compound. The C„ atoms, as well as those atoms 

symbol for an undetermined or "nonstandard" amino acid ts X. 
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Name, 
Three-letter Symbol, 
and One-letter Symbol 



Residue 
Structural Mass 
Formula* (D)* 



Average 
Occurence 
in Proteins (%)* 



Amino acids with uncharged polar side chains 



Serine 

Ser 

S 



COO" 
I 

H— C— CH 2 — OH 



87. 



Threonine COO H 
Thr 
T 



H— C C~ CH, 



NHj OH 

Asparagme' COO o 

I / 
C— CH 2 — C 



Asn 
N 



\ 



Glutamine* COO" 
uin H — C—C I It — - CA U — C 



0 



I 

NH* 

Tyrosine COO" 

t I 

xyr H _ C _ CH . 

■Y I 

m 

Cysteine COO"" 



\ 
NH 2 



101.1 



114,1 



128.1 



163.2 



103.1 



Cys 



n— c— CM,- si i 



Amino acids with charged polar side chains 

Lysine COO" 
. Lys 



128.2 



K 



H— C— CH,— CH,—CH,— CH,— NHf 



mi 1 



Afgtnine COO" 



Arg H _ c _ CH _ CH — C H 2 — NH— C 

R I \ 

N'HJ ' 



NH 2 156.2 

/ 



"h 



NHJ 



Histidine 
His 



COO" 

; His h-c-ch^ 3 ij 



Aspartic acid* COO" 



/Asp 
■D 



H 

O 

/ 



H — C — CH 3 — C 
NH 3 + 



Glutamic acid 47 COO 
Glu 



137.1 



£15.1 



129.1 



6,8 



5.9 



4.3 



4.3 



3.2 



1,9 



5.9 



5,1 



2,3 



5.3 



6.3 



a-CQOU* 



2.19 



2,09 



2.14 



2.17 



2.20 



1.92 



2.16 



1.82 



1.99 



2.10 



P#2 



9.21 



9.10 



8.72 



9.13 



Side chain* 



9.21 10.46 (phenol) 



10.70 8.37 (sulfhydryl) 



9.06 10.54 (*NHf) 



8.99 12.48 (guanidino) 



9.33 6.04 (imidazole) 



9.90 3.90 (JJ-COOH) 



9.47 4.07 (y~COOH) 



